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ABSTBACT 

Three procedures are exaained that educational 
researchers can use to analyze quantitatively the extent to which 
nodif ications of specific teacher behaviors lead to changes in 
student achievement. The tasK requires decent descriptions of the 
behaviors, appropriate statistical tools and seasures of student 
outcoses that are worth while exaaining. The data, collected in the 
course of an evaluation of the relationship between the 
organizational structure of schools and student achieveaent, is not 
entirely appropriate for the issues dealt with here- Therefore, paths 
are suggested that this research night take in the future: (1) it is 
possible, using aultiple regression analysis, to construct aod&ls of 
teacher behavior that reflect the view that soae set of teacher 

aircjct z.tzt'^T.t cutcs--'*^ * Hese aodels can be 

applied at least three distinct ways, depending on the kinds of 
assuaptions that one is willing to make about the data and the kinds 
of questions that one wishes to address. (Author/BC) 



TOWARDS A MODEL OF TEACHER BEHAVIOR 
AND STUDENT ACHIEVEMENT 



by 

Barry D. Anderson 
Judith Kaplan 

Washington University 
St. Louis, Missouri 



I. Introduction 

In this paper we will examine three procedures that educational 
researchers can use to analyze quantitatively the extent to which modifi- 
cations of specific teacher behaviors lead to changes in pupil achievement. 
This is not a simple task. It requires decent descriptions of the behaviors, 
appropriate statistical tools and measures of student outcomes that are 
worthwhile examining. Our data, collected In the course of an evaluation 
of the relationship between the organizational structure of schools and 
student achievement. Is not entirely appropriate for the issues we wish to 
deal with in this paper. Therefore we can only suggest paths that this 
research might take in the future. 

Our major arguments are as follows: 

1. It is possible, using multiple regression analysis, to construct 
models of teacher behavior that reflect the view that some set of teacher 
behaviors affect student outcomes. 

2. These models can be applied at least three distinct ways, depending 
on the kinds of assumptions that one is willing to make about the data and 
the kinds of questions that one wishes to address. 

Following a brief description of the sample and measurement instruments 
we will examine each of these issues in turn, 

TI. Procedure 

a) The Sample 

The schools participating in our study are located in a suburban 
conmunity. The school district is racially mixed (approximately 11% black) 
and represents a ranp,e of socio-econoraic backprounds. As a consequence of 



de facto segregation, most of the schools are fairly homogeneous on both 
racial and social class indices. 

A sample of 14 fifth grade classrooms was chosen from seven of the 
eleoentary schools in the district. The classes were selected to represent 
both a variety of social class backgrounds and the spectrum of classroom 
organizational styles. Questionnaires were administered to the students in 
these rooms on three occasions. The data for the analysis of student 
outcomes in year one is taken from the first two administrations of the 
questionnaires, September, 1971, and May, 1972. The analysis of the second 
year data uses the test scores collected in May, 1972 , as pre-test measures 
and those collected in May, 1973, as post-test scores. 

As with most longitudinal studies there was some attrition due to 
stvdents moving out of the district, illness, etc. Our final sample (N«187) 
included only those students for whom there was complete data on all 
variables* Several equations were run with less stringent requirements, but 
they do not differ in any significant way from those reported in this paper. 

b) The Measures ^ 
i» Achievement 

The achievement measure selected for use in this study was math. The 
arithmetic scale consists of 36 items from the CTBS tests of Computation, 
Concepts and Applications, 

ii. Affective Outcomes 

The current trend in the educational literature is to attend not only 
to the strictly cognitive outcomes of schooling, but to also consider 

^Other measures of cognitive and affective development were collected 
for use in the original study. They are not Included in the present analysis 



^outcomes that are more appropriately considered affective. 

A measure of satisfaction with school, adapted from Brayfield and 
Rothc's (1951) Index of Job Satisfaction, was administered to the students. 
Ulth appropriate modifications In the wording of the Items, 8 of the 
original 18 Items were Included In the final scale. The others were deleted 
because of low factor loadings and/or tfhe difficulty of translating the Item 
from on** context to another. A factor analysis Indicated that the eight 
Items loaded on a single factor, 

111. Teacher Behavior 

A questionnaire, adapted from Schafer's (1965) Children's Report of 
Parent Behavior Inventory, was administered to the students to assess their 
perceptions of their teacher's behavior.^ . The adaptations consisted of 
deleting Items that did not pertain to a classroom setting and changing the 
word "parent" to "teacher" In those Items that were appropriate. Following 
a factor analysis of the first round of data, 34 Items were retained for 
Inclusion In the remaining administrations of the scale. Thirty of these 
Items loaded on two factors that we call Carping Criticism (cf, Henry, l%3: 
302 ff,) and Varmth. The remaining four Items constituted a third factor. 
Freedom or Autonomy, On the assumption that students would require some 
time to develop stable perceptions of their teacher's behavior, the measures 
of teacher bel avior are derived from the May, 1972, administration of the 
Schafer scale year one and the May, 1973, administration for year two. 
Individual perceptions of teacher behavior were averaged over all students 
In a room; the resulting measure was used as a description of teacher behavior 
for that room. 

^in addition, student observers were trained In the use of Soar's (1966) 
schedule of teacher behavior. The observational data Is not Included In this 
analysis. 



Schafer reports on the validity and reliability of the original scale. 
In our own analysis^ the communality estlmatas from factor analysis serve as 
lower bound estimates of reliability and construct validity^ The 
coefficients ate comparable to those obtained in other studies that have 
used factor analysis as a means of validating Instruments (e.g,^ Punch, 1967). 

Ill* Devising an Appropriate Model 

In general > theories about teacher effects state that teacher behavior 
affects the academic performance of students. From this very general view, 
we feel that the appropriate test of whether or not (or to what extent) 
teacher behavior does affect children is to consider a mixed level analysis: 
individual student background measures are examined In conjunction with 
aggregate measures of the classroom environment (teacher behavior) in the 
production of individual student outcomes. 

Economists refer to the problems related to changing levels of analysis 
as problems of aggregation (when individuals or groups are lumped together) 
and dls-aggregation (when they are separated). In sociology, the seminal 
treatment of the problem appears to be Robinson's (1950) paper on "Ecological 
Correlations and the Behavior of Individuals." The authors of the OEO Report 
( Equality of Educational Opportunity ) do not deal directly with the level of 
analysis issue. The design of their study, however. Indicated an awareness 
of the need to employ multi-level models in the analysis of the schooling 
process. Coleman et al refer to the design as a "two level" regression 
analysis. VJe found no systematic treatment of the problem In the education 
literature; we are fairly certain that the particular methodological Issue 
docs not play a large part in the design and reporting of studies of teacher 



behavior. 

The appropriate model, then, should describe much more than mean 
differences between rooms. It should also describe the effects of various 
dimensions of teacher behavior on Individual student outcomes while at the 
same time considering the characteristics of the individual student. To do 
this requires a modelling technique capable of handling individual and 
classroom level variables simultaneously. Repression analysis, vhlch is the 
most general form of the analysts of variance as commonly used in educational 
research, meets these specifications. 

There are good reasons for researchers to be concerned with the unit of 
analysis issue--partlcularly the level of aggregation on the "output" side 
of the equation. First, multi-level analyses of the schooling process 
correspond to our general impressions of what the classroom teaching process 
is all about. Second, to study only classroom means is to Invite misleading 
answers. 

To Illustrate the latter point we draw on a paper by Rosenshine and 
Furst (1971), "Research on Teacher Performance Criteria." In their attempt 
to synthesize some of the research on teacher effectiveness, the authors 
utilize the rubric "process-product" studies to describe ", . . investigations 
which attempt to relate observed teacher behaviors to student outcome 
measures." These studies are c ctrelatlonal In nature. 

The correlational studies cited by Rosenshine and Furst appear to be 
based on classroom means. A teacher behavior Is noted, mean classroom 
learning Is computed, and correlations between the teacher behavior and mean 
student learning arc calculated over a sample of classrooms. The correlation 
(squared) can be interpreted a«? the amount of variance in "mean" learning 



associated with variance In teacher behavior, A correlation of .70 between 
teacher clarity and mean pupil achievement, Indicates that 497. (•70'-) of the 
between room variance in student achievement Is attributable to teacher 
cjlarlty. However, If only 20% of the variance between students Is attributable 
to rooms, then only 9.8% (.49 x .20) of the variance in individual outcome 
scores Is attributable to teacher clarity In correlational terms, the 
coefficient would drop from .70 to .31 v , .098 ) by changing from rooms to 
students as the unit of analysis. Statistically significant or not, such 
small relationships would probably not be regarded as important by 
practitioners faced with the task of Improving the scores of Individual 
students. Very simply, using classrooms as the unit of analysis will affect the 
value of correlations bettraen environmental variables and student outcomes, 
usually resulting In over-estimates of the size of environmental effe::ts. 

IV. Uses of Regression Analysis 

The choice of regression analysis does not completely resolve the 
methodological Issues involved in modelling teacher behavior. As Coleman 
(1972) notes In his provocative paper, "Integration of the Social Sciences 
through Policy Analysis," there are at least three different ways In which 
the technique can be used. 

Coleman reviews the various approaches, with particular emphasis on the 
assumptions that underlie their use. Briefly, these Include the use of 
regression analysis (1) to estimate the parameters or coefficients for a 
well specified model with known structures; (2) as a technique for uncovering 
the causal structure In a set of variables when some prior assumptions can 
be made about the causal relations among them (path analysis); and (3) as a 
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procedure for partitioning the regression sura of squares In Instances where 
no causal model can be specified and errors of measurement and collnearlty 
are thought to be prevalent. It is apparent that each of these successive 
uses of regression analysis requires less stringent assumptions about the 
structure of the proposed model and the crudeness or sophistication of the 
measures employed* We will examine the use of methods one, parameter 
estimation* and three, variance partitioning. In the analysis of teacher 
behavior and student achievement. 

a) Partitioning of Variance 

The use of variance partitioning procedures requires relatively few 
assumptions about the structure of the linear regression model. In fact, 
all that one need assume Is that the direction of "causality" is from the 
Independent to the dependent variables. (Coleman, 1972) The crudeness of 
the measures generally employed in educational research Is a rationale for 
adopting variance partitioning techniques. Mood (1971), for Instance, 
groups the variables he works with Into broad factors, on the assumption that 
the Individual measures that he employs are first. Inaccurate (measurement- 
wise) and second, are only proxies for the variables that he Is considering. 
For Instance, he subsumes under the general factor of "peer quality" such 
measures as parental expectations for higher grades, hours of homework, 
plans to go to college, etc. Also, for researchers concerned with the 
location of bases for Implementing change In edur itional Institutions, 
variance partitioning techniques are useful for Identifying Independent 
(orthagonal or uncorrelated) factors. 

The purpose of variance partitioning Is to determine what part of the 
explained variance can be attributed "uniquely" to each of the Independent 
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variables and that part which Is due to collnearlty among the Independent 

* 

variables* One way to conceptualize the procedure Is to regard it as an 
attempt to firsts estimate the amount of variance in a dependent variable 
which is attributable to an independent variables over and above the 
variance attributable to other variables in the set, and second » to 
estimate the amount of variance in the dependent variable which is shared 
among the predictors. 

Our analysis is based on the procedures outlined by Mood (1971) , 
although interested researchers should investigate competing techniques used 
by Astln (1970) > Ward (1963), or Newton and Spurrell (1967), The following 
models are required: 

(2*1) OUTi « f(Sij+IBVj^) 

where: S^j is 1 if student 1 Is in room j, 
zero otherwise, 

IBVj^^ is the score for student 1 on 

background variable k, 

OUTj^ is the outcome score for student 1, 

(2.2) OUT^ " f(lBVi^) 

(2.3) OUTj - f(Sjj) 



j, - I (MTRA^-HTrBBj-HTTRCp 



where: TTTBAj to >iTBCj are the classroom average 



scores on the three teacher behavior factors 
for the teacher in classroom j. 



(2.5) ouTi ° f onBAj+KriiBj-tmnCj+inVii^) 



Model 2.1 is referred to as a "fuH" model; It represents all 
differences that exist between rooms as well as the student background 
characteristics. Model 2.2 places a restriction on the full model such that 
differences between rooms are assumed to be zero; student outcomes are 
predicted solely as a function of Individual background characteristics. 
Model 2.3 Is the multiple regression analog of a one-way ANOVA. It attempts 
to explain differences between Individual student outcomes In terms of 
unspecified differences between rooms. Model 2.3 signifies the upper limit 
on the amount of variance In the outcome that can be explained by atvjr sort 
of variance In the classroom context. Assuming that this variance is large 
enough to be of Interest, the researcher may wish to know how much of the 
variance In the outcome can be attributed to his particular measure of the 
classroom environment. This estimate Is obtained by replacing the 1*8 in 
model 2.3 with measures of the classroom context, resulting In model 2.4. 
Any difference In the predictive efficiency (RSQ) of models 2.3 and 2.4 
indicates the extent to which the replacement measures do not fully represent 
all differences between rooms In terms of the dependent variable. Model 2.5 
predicts student outcomes from knowledge of specific teacher behaviors and 
individual background data. The reduction In explained variance from 
model 2.1 to model 2.5 Is a further indication that the teacher behavior 
measures do not fully account for the variance attributable to the classroom 
context. 

Results: 

Table 1 cotitains the means, standard deviations and correlations for 
the variablcn employed in the study. 
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Model 2.1 (Table with all of the variables included, accounts for 

approximately 73% of the variance in math achievement. A comparison of 
model 2,2 with model 2,1 indicates that only 6,2% of the total variance can 
be attributed to differences between rooms over and above differences 
associated with student background characteristics. Indeed, under Ideal 
circumstances, where student background can be assumed to be unrelated to 
room assignment, only 15% (model 2,3) of the variance in student achievement 
Is associated with any differences between rooms. 

Fifteen percent isn't much, six percent is even less— but these figures 
look like other estimates of the amount of variance in achievement that lies 
between school units. And, their size is NOT a function of inadequate 
measures of teacher behavior. 

Finally, we note that about 58% of the variance in student achievement 
is associated with the student background measures, over and above differences 
between rooms. Some 9% of the variance is associated with joint effect of 
differences between rooms and student characteristics. In other words. In 
this sample It is impossible to disentangle a part of the background and 
room effects, 

A second step In the analysis Involves substituting measures of teacher 
behavior which describe the rooms for the dummy variables which simply 
Indicate in which room a student is located, (The number of descriptors 
should be less than the number of rooms if degrees of freedom are not to be 
exceeded.) To accomplish this, we used three measures of teacher behavior: 
carping criticism, teacher warmth and the extent to which the teacher extends 
freedom and autonomy to the students. Once again, variance partitioning 
shows that a large amount of the variance In math achievement Is associated 
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Table TI 

Part it toning of Variance in Math Achtevenient 
Into Classroom and Background Related 
Sources - Year 1 

Full Model /model l.ff 72*98% 
Unique to Background /model 2*1 - model 2*2/ 57^96% 
Unique to Classrooms /model 2.1 • model 2.2^/ 6*147* 
Overlap 8»88X 



Partitioning of Variance In Math Achievement 
Into Teacher Behavior and Background 
Related Sources > Year 1 

Full Model /model 2.57 68.747o 
Unique to Background ^model 2,5 - model 2.4/ 65»82% 
Unique to Teacher Behavior /model 2.5 - model 2.2^/ 1*90% 
Overlap 

U02% 



p<.01 
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with background, over and above the measures of teacher behavior. About 27, 
Is associated with teacher behavior over and above room effects. Only 17» Is 
shared bett*een the two sets of predictors. 

By comparing model 2,3 and 2,4 we see that while 15% of the variance In 
math achievement lies between rooms, only 37, can be attributed to teacher 
behavior. Either these measures of teacher behavior are not what cause the 
differences in the outcorae--or, the measures of behavior are Inadequate, 
For the moment we can accept either Interpretation, for the finding is not 
as Important as the fact that we have outlined an easy way to assess the 
adequacy of our measures of teacher behavior. Models 2.1 and 2.3 are a 
standard against which the measures can be evaluated; they require no 
assumptions about what is being measured on the independent side of the 
rrgression equation. 

Substantively, we obtain the same kind of results when student 
satisfaction Is used as the dependent variable (Table III), Not much 
variance Is attributable to rooms, fully 307, is explained by the background 
variables, and there is some, but not much, overlap between rooms and 
background. Much less of the total variance is attributable to variables in 
the full model (387. against 737, for math). However, Figure I highlights an 
Important problem; the rooms which had high satisfaction scores are not the 
same as those which had high math scores. (The rank-order correlation is 
.1308 (p>.05)). It seems probable, then, that activities that might lead 
to high math scores might not contribute to, or even reduce, satisfaction 
scores. This observation represents a problem that requires extensive 
treatment in its own right. For now, we will proceed on the assumption that 
teachers are willing to maximize one output while perhaps seeking only to 



-14- 



BEST COPY AVAiUBlE 



Table III 

PartltiontnR of Variance tn Satisfaction with School 
into Classroom and Background Related 
Sources - Year 1 

Full Model /model 2.l7 38.667. 
Unique to Background /inodel 2.1 • model 2.37 30.01%* 
Unique to Classroom /model 2.1 - model 2,2f 6.41% 
Overlap 2.247, 



Partitioning of Variance In Satisfaction with School 
into Teacher Behavior and Background Related 
Sources - Year 1 

Full Model /model 2.5? 

Unique to Background /model 2.5 - model 2.47 
Unique to Teacher Behavior /model 2.5 - model 2.27 
Overlap 



34.497. 

25.847* 
2.24% 
6.41% 



*p^ .01 
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Figure I 

Classroom Devtatlons from Mean Satisfaction and Math* Scores - Year 1 
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avold serious problems with other outcome (s) that might be considered. In 
the Interest of parsimony we will confine our subsequent analyses to the math 
outputs. 

If the model uncovered by variance partitioning cannot be used on other 
sets of data, its practical significance Is Indeed limited. One way to 
examine the stability of the model Is to compare equations derived from a 
second year of data collected with the same sample of students as they moved 
Into the sixth grade. The relevant second year equations are 3,2, 3.4 and 
3.5 in Table IV. These contain information about background, teacher 
behavior in the second year In all three possible combinations, (Their 
equivalents are models 2.2, 2.4 and 2.5 from the first year of data.) From 
a comparison of Tables II and IV it Is evident that the results of the 
variance partitioning (in math achievement) between these two seta of variables 
does not change substantially from the first year to the second. An F-test 
of the statistical significance of the difference in the amount of variance 
attributable to teacher behavior, and the background variables reveal that 
the differences from year one to year two are not statistically significant 
at the .05 level. (See Table VI.) 

Conclusion from the Variance Partitioning 

From all of the above it seems that teacher behavior, as measured In 
this study, has a trivial effect oa both math achievement and satisfaction 
with school. Moreover, it does not seem likely that any measure of the 
classroom environment can account for more than 15% of the variance in math 
achievement nor 87, of the variance In satisfaction with school. 
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Table IV 

Partitioning of Variance in Math Achievement 
into Classroom and Background Related 
Source? - Year 2 

Full Model /model 3.l7 70.43% 
Unique to Background /model 3.1 - model 3,37 58,26%* 
Unique to Classrooms /nwdel 3,1 - model 3,27 8.73%* 
Overlap 3,44% 



Partitioning of Variance in Math Achievement 
Into Teacher Behavior and Background 
Related Sources - Year 2 

Full Model /model 3.57 66,95% 

Unique to Background /model 3,5 - model 3,47 62.49%* 

Unique to Teacher Behavior /model 3,5 - model 3,2/ 5,257» 

Overlap -.0079% 



*p1 .01 
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Table V 



Partitioning of Variance ik Satisfaction with School 
Into Classroom anji Background Related 
Sourc*6 - Year 2 



Full Model /model 3.17 ' 35.727. 

Unique to Background /model 3,1 - model 3.37 20,35% 

— -T irk 

Unique to Classrooms /model 3.1 - model 3.2/ 11.177o 
Overlap A. 207. 



Partitioning of Varlanc-: In Satisfaction with School 
into Teacher Behavior and Background Related 
Sources •• Year 2 



Full Model /model 3.5/ 32.04% 
Unique to Background /model 3,5 - model 3.47 20.517.* 

Unique to Teacher Behavior /model 3.5 - model 3.27 7.49%* 
Overlap A. 04% 
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b) Parameter Estimation 

If we are willing to assume that our model of teacher behavior and 
student achievement Is well specified, that the relevant variables have been 
Included in the model and that our measures are relatively accurate, we are 
warranted In using regression analysts to generate the parameters of our 
model. A distinct advantage of using regression analysis 
in this manner lies in the Interpretation of regression weights (the 
parameters or regression coefficients . For example, a regression weight of 
-.1790 attached to the first dimension of teacher behavior, carping criticism 
(Table VII, model 2,5) permits us to make the following statement: for every 
increase of one point in the average student perception of teacher behavior 
(carping criticism) there is a decrease of .18 points In the student's post- 
test math achievement. Beta weights, vrhich are nothing more than standardised 
regression coefficients, are open to a similar Interpretation, except that a 
change In the outcome variable due to a predictor Is reported In standard (z) 
scores. Except for the case when the predictor variables are orthogonal, we 
cannot generate equivalent statements by considering the unique and overlapping 
parts of the partitioned variance. 

Consider the needs of a district superintendent, faced with the problem 
of improving the academic achievement of the students In his various schools. 
He hires a consultant to advise him on the most promising course of action. 
The consultant collects a variety of measures pertaining to the organisational 
structure of the school, actual teacher behaviors, number of books per 
student, teacher salaries, and the number of remedial programs per x nunber 
6T students in addition to data on student background characteristics. If 
the data Is analyzed by partitloni%ig the variance among the various factors. 
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the results will Indicate the relative Importance of the variables or sets 
of variables. The estimation of the parameters of the models on the other 
hand> will provide^ In addition, an estimate of the expected Increase in the 
outcome per standard unit increase In a particular predictor variables. The 
administrator Is then In a position to consider simultaneously the investment 
(In dollars- -or other terms) required to manipulate the predictor variable 
and the expected Improvement In the outcome given that investment* It may 
turn out for example » that although the largest increase in the outcome 
measure can be expected as a result of increasing^ by one» the nuniber of books 
per pupil ^ an even larger increment can be effected ^ for the same co8t» by 
increasing flve^fold the number of remedial reading programs per x number of 
students. 

There Is a potential difficulty In working with regression weights which 
is evident even from our example. When we employ a measure such as number of 
books per pupil as a quality Indices for the school^ it is possible that the 
measure is nothing more than a proxy for the wealth of the schools^ and by 
definition^ the wealth (or SES) of the students. Unfortunately^ by 
manipulating the variable number of books per pupil^ we will probably not be 
able to effect the desired outcome. This is an issue that Mood (1971) alludits 
to in his defense of variance partitioning as the appropriate statistical tool 
for analyzing the schooling process. 

An interesting and informative application of these procedures would be 
to run the regression equations (and estimate the parameters) separately for 
various groups of students* For instance^ if the researcher or practitioner 
believed that the various dimensions of teacher behavior affected different 
racial groups differently^ he might construct two models^ one for whites and 
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another for blacks. A comparison of the beta weights (for teacher behavtors) 
would indicate whether this hypothesis were, in fact, tenable. 

Results 

Tables VII through X present the results of the parameter estimation for 
^ears one and two for the two outcomes, math achievement and satisfaction 
with school. 

Table VII, model 2.5, suggests that the pre-test measure is the most 
important predictor of post-test math achievement in year one. The other 
background variables make a conq>aratlvely small contribution to the outcome. 
(In the event that the reading pre-test measure were dropped from the 
equation, a variable such as IQ, which is highly correlated with the pre-test 
measure, would assume more importance.) The teacher behavior measures: 
carping criticism, teacher warmth and freedom or autonomy all make small 
negative contributions to the outcome. However, it Is not necessary to be 
overly concerned with these figures (which perhaps do not correspond to our 
expectations). The B weights for these variables are not significant 
(p^.05). I.e., more than five times out of a hundred we would obtain B 
weights such as these by chance. 

At this point, we are again faced with the problem of evaluating the 
stability of our proposed model. If we simply examine the regression weights 
attached to the variables In model 2.5 (year one. Table VII) and model 3.5 
(year two. Table IX), we see some rather dramatic differences. While the 
pre-test measure Is once again the most Important variable \n predicting the 
outcome in year two, the B coefficient Is smaller In year two than In year 
one. The B weights attached to sex and number ^ siblings have changed 
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direction. The most noticeable changes, however, have occurred. In the 
parameters associated vlth the three dimensions of teacher behavior; the 
size and direction of the B weights have changed substantially for all three 
measures. 

We applied a test (Huang, 1970) of the constancy of a subset of 
regression coefficients (B weights) to the measures of teacher behavior 
predicting math achievement. We rejected the null hypothesis (Hq: Bj^-Bg) 
at the .01 level of significance (F - 9.68, d.f. » 3,354). We applied the 
same test to the background variables; the null hypothesis (H^: Bj^"»B2) 
could not be rejected (F « 1.3363, d.f. " 6,354). 

In summary,, the B weights for teacher behavior changed significantly 
over the period in question. The background measures » on the other hand, 
behaved consistently over the two saaq>les. We acknowledge that our 
interpretation of these F statistics is suspect — the measurement Instruments 
are relatively unsophisticated, making it extremely difficult to determine 
whether the change (and stability) we have verified statistically are, in 
fact, spurious. Regardless, we feel that the value of the procedure is that 
it forces educational researchers to consider the issue of the stability of 
their proposed models. 
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